Cs-621 Theory Gems
نویسندگان
چکیده
So far, we have seen streaming algorithms for two important variants of Lp-norm estimation problem: L0-norm estimation (the distinct elements problem) and L2-norm estimation. We also noted that the L1norm estimation problem (at least, when we do not allow element deletions) corresponds to just computing the length of the stream and thus can be trivially solved in O(log n) space. Therefore, the next natural step would be to try to approach the task of L∞-norm estimation, i.e., estimating the frequency of the most frequently occurring element. Unfortunately, as mentioned last time, one can show (and we will see it soon) that L∞-norm estimation requires Ω(m) space, which is completely prohibitive from the-point of view of streaming algorithms. This is rather disappointing, as ability to efficiently compute this very fundamental statistic of the data streams would be very valuable. Fortunately, despite this negative result, there is still hope for getting something useful here. Namely, as we will see later, the L∞-norm estimation problem instances that are used in the Ω(m) lowerbound are all corresponding to a situation in which every element (including the most frequent one) has only very few (in fact, at most two) occurrences in the stream. However, this kind of instances are not really that interesting from the-point of view of the intended applications of an L∞-norm estimation algorithm. The scenarios that we would really like to address are the ones in which the most frequent element appears with substantial frequency (think, e.g., about a router trying to detect a Denial-of-Service attack). As it turns out, once we reformulate our problem to allow it to not provide any meaningful answer when there is no very frequently appearing elements, the Ω(m) lowerbound does not hold anymore and one can obtain very satisfactory algorithms for such scenarios.
منابع مشابه
Cs-621 Theory Gems
That is, we want the hyperplane corresponding to (w, θ) to separate the positive examples from the negative ones. As we already argued previously, wlog we can constrain ourselves to the case when θ = 0 (i.e., the hyperplane passes through the origin) and there are only positive examples (i.e., l = 1, for all j). Last time we presented a simple algorithm for this problem called Perceptron algori...
متن کاملCs-621 Theory Gems
In Lecture 10, we introduced a fundamental object of spectral graph theory: the graph Laplacian, and established some of its basic properties. We then focused on the task of estimating the value of eigenvalues of Laplacians. In particular, we proved the Courant-Fisher theorem that is instrumental in obtaining upper-bounding estimates on eigenvalues. Today, we continue by showing a technique – s...
متن کاملCs-621 Theory Gems
Today, we will briefly discuss an important technique in probability theory – measure concentration. Roughly speaking, measure concentration corresponds to exploiting a phenomenon that some functions of random variables are highly concentrated around their expectation/median. The main example that will be of our interest here is Johnson-Lindenstrauss (JL) lemma. The JL lemma is a very powerful ...
متن کاملCs-621 Theory Gems
As we mentioned in the last lecture, one of the core components of our algorithm for distinct elements problem is a space-efficient construction of 2-wise independent functions. Formally, a k-wise independent hash function f : [m]→ [T ] is a randomized function that provides the guarantee that, for any k distinct elements j1, . . . , jk ∈ [m] and any k possible values t1, . . . , tk ∈ [T ], the...
متن کاملCs-621 Theory Gems 1 Learning Non-linear Classifiers
In the previous lectures, we have focused on finding linear classifiers, i.e., ones in which the decision boundary is a hyperplane. However, in many scenarios the data points cannot be really classified in this manner, as there simply might be no hyperplane that separates most of the positive examples from the negative ones see, e.g., Figure 1 (a). Clearly, in such situations one needs to resor...
متن کاملCs-621 Theory Gems 2 the Stock Market Model
The main topic of this lecture is the learning-from-expert-advice framework. Our goal here is to be able to predict, as accurately as possible, a sequence of events in a situation when our only information about the future is coming from recommendations of a set of “experts”. The key feature (and difficulty) of this scenario is that most – if not all – of these experts (and thus their recommend...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012